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Summary Coalescent processes, including mutation, are derived from Moran type popula- 
tion models admitting large offspring numbers. Including mutation in the coalescent process 
allows for quantifying the turnover of alleles by computing the distribution of the number of 
original alleles still segregating in the population at a given time in the past. The turnover of 
alleles is considered for specific classes of the Moran model admitting large offspring numbers. 
Versions of the Kingman coalescent are also derived whose rates are functions of the mean and 
variance of the offspring distribution. High variance in the offspring distribution results in higher 
turnover and younger age of alleles than predicted by the usual Kingman coalescent. 
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• 1 Introduction 

X 

The age of an allele is central to understanding the nature of extant variation. The first mathe- 



matical treatment of the age of alleles were done by Kimura and Ohta ( 1973 1, Maruyama ( 1974 1, 



and Maruyama and Kimura (19751. In particular, Kimura and Ohta (19731 find, using Kimura 



(1964)'s diffusion model, that alleles in low frequency in the population have surprisingly old 



expected ages. 
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More recently, coalescent techniques have been applied to the study of the age of an allele 



(Donnelly and Tavare 1986 Griffiths and Tavare 1998 Saunders et al. 1984 Stephens 2000 



Tavare 1984 Watterson 19841. Two main extensions to the coalescent were obtained by Saun- 



ders et al. ( 1984 1 One is the analysis of the ancestry of nested subsamples, which, for example, 



yields the probability that a sample shares its' most recent common ancestor with the whole pop- 
ulation. The other extension is the inclusion of neutral mutation in the ancestral process, which, 
in turn, yields the probability that the oldest allele in a population is included in a sample. These 



extensions are reviewed by Tavare (1984). Donnelly and Tavare (1986) consider age ordering of 
alleles in a sample, and obtain, for example, the distribution of the number of representatives 



of the oldest allele in a sample, a result originally due to Kelly (19771. The ancestral processes 



derived by Watterson (1984), Saunders et al. (1984), Tavare (1984) and Donnelly and Tavare 



( 1986 ) were all based on the usual Moran ( 1958 1962 1 model of reproduction. The Moran model 
belongs to a large class of exchangeable population models, originally introduced by |Cannings| 



(1974). The Moran, and indeed the celebrated Wright-Fisher (Fisher 1930 Wright 1931) model 
can be characterized as low offspring number models by allowing individuals to have very many 
offspring with only negligible probability. 



Population models admitting large offspring numbers with non-negligible probability (Eldon 



and Wakeley 2006 Huillet and Mohle 2011 Sargsyan and Wakeley 2008 Schweinsberg 2003) 



have recently been proposed as appropriate for organisms with high fecundity and high initial 



mortality, characteristics of many marine tax;a (Arnason 



2004 



Beckenbach 



1994 



Hedgecock 



1994 Hedgecock et al. 1982), and possibly forest trees (Ingvarsson 2010). The gene genealogies 



of samples drawn from populations admitting large offspring numbers are characterised by mul- 



tiple mergers of ancestral lineages (Donnelly and Kurtz 1999 Mohle and Sagitov 2001 Pitman 



1999 Sagitov 1999 Schweinsberg 2000), or in which any number of active ancestral lineages 



may coalesce to the same common ancestor. In comparison, the Kingman coalescent (Kingman 



1982a|b I allows at most two lineages to coalesce each time. One consequence of the multiple 



merger property is that exact computations of many quantities of our interest can only be done 
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recursively. 

The age and rate of turnover of alleles in a population admitting large offspring numbers is 



(1984 


) and 


Tavare 


(1984 



of nested subsamples drawn from a modified Moran model population, in which the number 
of offspring contributed each timestep to the population is random. The rate of turnover of 
alleles associated with the Kingman coalescent derived from populations with different offspring 
distributions is considered, as well as probabilities of a subsample carrying the oldest allele of a 
sample drawn from populations admitting large offspring numbers. 

2 A population model of overlapping generations 

Consider a simple haploid population models of overlapping generations, in which at each 
timestep a single individual (the parent) chosen uniformly at random from the population con- 
tributes offspring to replace those who perished in that timestep, so the population size stays 
constant at N. The coalescent processes obtained from these population models will then be 
used to study the coancestry process of nested subsamples. Let U denote the random number 
of offspring produced by the parent, who always persists. In the usual Moran model, the parent 



always has one offspring (U — 1 a.s.). Eldon and Wakeley (20061 consider a simple mixture dis 



tribution, in which U = I with probability 1 — e,,, and U takes value \_ipN\ with probability e^^, 



in which ^ G (0, 1) is a constant. Eldon and Wakeley (20061 use this simple model to illustrate 
the idea that population models admitting large offspring numbers may be more appropriate for 
high fecundity organisms than the usual Wright-Fisher and Moran models. A clear drawback of 



the model of Eldon and Wakeley (20061 is that -0 is a constant, which means that the parent 
always has exactly the same number of parents, even when a large offspring number event oc- 
curs. Two simple extensions of the usual Moran model will now be considered. Our aim is to 
incorporate, in a natural way, the randomness inherent in the number of offspring contributed 
each timestep. 

We will work with two models. First, let us set notation. Let X be a random variable with 
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probability distribution Px, in which ¥[X > 1] = 1, and finite moments, sup^E[X'^] < oo for 
all fc > 1. Let y be a random variable taking values on the unit interval. Finally, let _B be a 
Bernoulli random variable with P[_B In model 1, we take U to be a simple mixture 

of X and Y (assuming YN e N) : 

U ^ X{1- B) + {NY)B (1) 

In model 2, we let U he a Poisson random variable with random mean M, in which M is a 
mixture of X and Y, M = X{1 - B) + {NY)B. 

The idea behind the mixture distribution is the assumption that most of the time the parent 
contributes only few offspring to the population due to restrictions on resources such as space. 
Occasionally, however, there is disturbance in the environment, such as a storm or a forest 
fire. The disturbance wipes away a fraction Y of the population, allowing NY offspring to take 
foothold in the population. For simplicity, we assume that the difference between the number of 
individuals that perish, and the number of offspring replacing them, is negligible. These ideas are 



similar to the ones proposed by Schweinsberg (20031, who models the distribution of 'potential' 



offspring, rather than the ones that actually take foothold in the population. However, Lemma 6 



Schweinsberg ( 2003 1 establishes a link between the transition probability of the ancestral 



process and the distribution of the potential offspring. The timescale in Schweinsberg (2003)'s 
model depends on a parameter of the distribution of the potential offspring. The timescale in our 
model will be independent of the distribution on [/. A similar idea of extending the usual Moran 



model is also raised by Birkner and Blath ( 2009 1 . One can also view the mixture distribution as 



a simple model of a bottleneck followed by a rapid growth, or recolonization. Indeed, [Grant and| 



Bowen ( 1998 ) suggest populations of sardines and anchovies may have been sucbject to periodic 
bottlenecks and recolonizations. 

Convergence of the coalescent process (i?m;m € Nq) arising from both models is given in 
Appendix, as proof of convergence relies on standard theory. The coalescent process {Rm \ m e N) 
without mutation describes the random ancestral relation among a set of n individuals drawn 
at random from the population at time to = 0. A state of the coalescent process Rm is, by 



definition, tlic random equivalence relation containing individuals only if i and j share a 
common ancestor at timestep r in the past. One can think of the coalescent process as following 
labelled (or enumerated) ancestral lines back in time. 

Time in our model will be in units of T/v = N'^ ^ ^n^' ^-"^^ ^^^^ therefore be independent of 
any parameters associated with the offspring distribution on X and Y . Thus, if N'^Sj^ — > then 
large offspring number events occur with negligible probability in a large population, and the 
coalescent process admits only mergers of two active ancestral lineages. Otherwise the ancestral 
process admits multiple mergers of active ancestral lineages. 

For our purposes it suffices to consider the gene genealogical process [A^f^; t > Oj that counts 



the number of ancestors, in which yl*^^^ is associated with model £ S {1, 2}. To describe the rate 
matrix of A^^\ write 

/?i = E[X(X + 1)] 

(2) 

p2 = 2E[X] + E[X^] 



If A^^Ejv — > large offspring number events are negligible and the rate matrix Qi = (^l^^j^ 
associated with model i is given by 



i,jeln] 



^]-{-i:2)P^ ifj' = * (3) 
otherwise. 



If N'^Ej^ tends to a constant 4> one obtains rate matrix Qi given by 

■(^)(/3, + 0E[r2(i-rr2]) ifj = j-i 



:)E[y'=(i - y)*-'^] ifj = i-fc + i 



3 < fc < i (4) 
-Q/3£ - 4>-E[l - (1 - YY - iY{l - Yy-^] iij^i 
otherwise. 

Finally, if N'^s^ — > oo large offspring number events are dominating and the rate matrix Q has 
entries 

■(^)E[y'=(i ifj = i-fc + i 

_ 2<k<i 
^ ^ -{1-E[{l-Yy + iY{l-Yy-'^]) ifj^i 
otherwise. 
In the present work we will primarily be concerned with three examples of a A coalescent. One 



is the process derived by Schweinsberg ( 2003 ) , in which A is the probability measure associated 



ith the beta distribution, for 1 < a < 2, 



A{dx) = 



'{l-x)°'-'^dx. 



T{2-a)T{a) 

In theory one can include the case a = I. When a = I, however, the A measure is simply 
the uniform distribution on [0, 1], which may not be biologically realistic. For comparison, a is 



estimated to be around 1.5 (Eldon 20111 for data on Atlantic cod (Arnason 2004). Another 



example of a A coalescent we will consider is when A is a scaled point mass at some point 
tp G (0,1), A{dx) — x^S.^dx. The point mass process is the process derived by Eldon and 



Wakeley ( 2006 1 . The third example we consider is when Y takes the beta distribution with 



parameters a and (3, in which case the rate ^ for j < i of the generator Q in Equation (jSj) 
takes the form, with 2 < k < i, 

_ B{k + a,b- k + a) 

in which •) is the beta function. There is no particular biological reasoning behind the 
choice of the beta distribution, other than Y takes values between zero and one. Allowing Y 
to be random seems a more realistic assumption than requiring Y to be fixed at some point 
between zero and one. 

Turning to the rate matrix (|3]), it differs from the one associated with the usual Kingman 
coalescent by the factor (3^. To compare our process to the Kingman coalescent, let E[T] denote 
the expected value of the time to the most recent common ancestor associated with the Kingman 
coalescent, and with E[r^^^] we denote the same quantity associated with model £. Similarly, 
let E[5] denote the expected value of the total size of the gene genealogy associated with the 
Kingman coalescent, and E[S'^^)] the same quantity associated with model £. One immediately 
obtains 



E[tW] = ^E[T] = 1(1 - 1/n), 
Pi Pe 

E[^W] = i-E[5] = -f J^i, i=l,2. 



(6) 



Hence, populations with offspring distributions whose mean and variance are large are predicted 
to have lower genetic diversity, as measured in number of segregating sites, than populations with 



smaller mean and variance, if they have equal rates of mutation. The unit of time of a standard 
Moran model population, or of populations associated with either model 1 or 2 is the same when 
N'^e^ 0, or N"^ timesteps. Thus, the results in Equation ([6| are directly comparable for 
populations with different offspring distribution. To further compare the different population 
models, let Fg(t}, Frpa), Fg, and Ft denote the cumulative density functions of the random 
variables S''^\ T^^', S, and T, respectively. Elementary calculations now give Fg(i-j{s) = Fs{Pes), 
and Frp{e) (t) = FT{liit) (s, i > 0). In wor ds, the probability that S'('') and T^'^) are no larger than 
some given values s and t, respectively, equals the probability that S and T are no larger than 
Pis and Pit, respectively. 

A further comparison between the usual Moran model, and models 1 and 2, can be made 
by considering the expected age of a mutation with frequency x G (0,1). In a Moran 

population, 

-9-7- 

(7) 



—2x 

E[C.] = - log(x), 



1 - X 



(Griffiths and Tavare 1998 Kimura and Ohta 19731. The argument of Griffiths and Tavare 



( 1998 ) can be adapted to obtain an expression of E[Ca;] in a population with reproduction following 
models 1 or 2, to give 

(8) 



PI 

Indeed, let Cn.b denote the age of a mutation in b copies in a sample of n individuals drawn from 
a usual Moran population, and Q ^ the corresponding quantity when drawn from a population 
with reproduction following model 1 or 2. Now, 



in which E[C„_f,] is given by Equation (5.4) in Griffiths and Tavare (19981. Hence, by taking 
appropriate limits as in Griffiths and Tavare (19981, one arrives at Equation ([8|. Hence, the age 
of mutations will tend to be overestimated if Equation ([7]) is applied to estimate mutation age 
in a population with high mean and/or variance in the offspring distribution. 
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3 Turnover of ancestral lineages 

The question on the age of extant genetic variation can be addressed using coalescent methods. 



Tavare ( 1984 1 and Donnelly and Tavare ( 1986 1 consider the coalescent process including muta- 
tion, in which the underlying reproduction mechanism is the usual Moran model. Each time a 
mutation event occurs, one lineage mutates into a 'new' class, and is hence no longer considered 
ancestral to the sample. This is equivalent of a lineage starting a new line of descent going 
forward in time. 

To study the turnover of ancestral lineages, we will work with the lineage counting process 
[Ar] r G No) including mutation. Since we allow mutation, the process Aj. takes values in {n\ = 
{0, 1, . . . ,n} if the initial number of lineages is n. When a mutation occurs on the last of the 
original n lineages, the process A^ reaches the absorbing state zero. The way mutation is included 
in the coalescent process is explained in Appendix. Weak convergence of ^i^^TjvJ to a continuous- 
time process At follows from convergence of the coalescent process R including mutation (see 
Appendix). If iV^e^^ — > the rate matrix Q = (q. j) j jg{„} associated with process At has entries 

. ^U{B+{i^l)fit)/2 if j=i-l, l<i<n 

in which j3i was defined in Equation ([2|. If A^^e^ — > (f) for some constant </) > 0, the rate matrix 
Q has entries 

l)/3£ + 6')/2 + 0(*)E[r2(i _y)«-2] if j = i - 1, 1 < i < n 

0(^)E[r'=(l - Yy-''] ii j = i-k + l,i<k<n 

- l)l3e + 0)/2- <j)E[l - (1 - YY - iY{l - Y)'-^] ii j = i 
otherwise 

(10) 

Finally, if N^s^ — ?> oo the entries of the rate matrix are 

{ie/2 + QE[Y^{1 - Yy-^] if j = i - 1, 1 <i <n 

(^)E[r'=(l-r)'=-2] ifj = z-fc+l, 3<fc<n 

-ie/2-E[i-{i-Yy-iY{i-Yy~^] aj^i 

otherwise. 

The rate of turnover of alleles is quantified with the probability hij{t) = F[At ~ j\Ao = i] 
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given by 



k=j 



J = « 



(12) 



= 1- ^ Kj{t), j^o 

i<i<i 



in which r^''' and ^^'^^ are the right and left eigenvectors, respectively, of the rate matrix associated 
with A. Only when the rate matrix of A is of the form ([9]), a closed form expression for the 
eigenvectors can be obtained as follows. Define, for some constant c > 0, the factorials 

a(c.fe) = a(a + c)(a + 2c) ■ ■ ■ (a + (k — l)c), 

ajc^fe] = a(a — c)(a — 2c) • • • (a — (A: — l)c). 

The results of ' 



(13) 



Tavare 



(1984) can be adapted to obtain the left fS''^ and right r'*^^ eigenvectors 



of the transition matrix given by Equation (joj). One obtaines t^^"^ = Sj,Q, and 



j 

p{k) _ 



V.?/ {ck + 9)(c,k-i) 



j>k>l; 
< k 



(14) 



in which c = Bg. In the same way, rf''^ — 1 for all j, and 



(fc) 



0, j < /c 

k) [cj + e)(c,k)' 



(15) 



Equations ( 12 15 1 now give, with i^^ = i(i — 1) ■ ■ ■ (i — k ~\- 1), 



((2fc„l)c+0)^f±%^, l<J<^. (16) 



i[k] 



One can also view the constant c as a population size scaling constant. The probability hj{t) 
that i of original lines of descent from the whole population are still present at time t is given 

by 



(17) 



'{c.k-l) 

c^k\ ' 



obtained by taking i oo. The formulas for hi_j(t) and hj{t) associated with the usual Kingman 



coalescent with mutation are recovered by taking c = 1 . Even though the sum in Equation (17) 
is infinite, the terms quickly become small. Even moderate values of 9 and c (= Pe) lead to quick 
turnover of alleles in the population (Table 1). 

The distribution of the number of ancestral lineages of either a finite sample, or, of the whole 



population, at any given time, is straightforward to obtain using the results of Littler (19751, 



Grifiiths (19791, Griffiths (19801, Watterson (19821, Kingman (1982b), and Tavare (1984). The 



form of the eigenvectors associated with the usual Kingman coalescent is the same as when 
associated with the modified coalescent derived from models 1 or 2; only the eigenvalues of the 
corresponding rate matrices differ in an obvious way. 

An explicit form of hj(t) is hard to obtain when associated with a A coalescent since the 
rate matrix is triagonal, and closed form expressions for the eigenvectors are not easy to obtain. 
The eigenvectors can, however, be computed recursively, as shown in Appendix. The probability 
hiQ^t) that all of i initial ancestral lineages have vanished by time t is graphed as a function of 
time t in Figure 1 for two examples of a A coalescent. In Figure la is the point mass process 



K{dip) = ip'^S^d^l) studied by Eldon and Wakeley (2006), and in Figure lb is the beta coalescent 
derived by Schweinsberg (2003). In the beta coalescent, time is in units proportional to N"~^. 
Hence, the results in Figure lb must be interpreted with the timescale property of a in mind. 
The tp parameter of the point mass process is not a timescale parameter; the values of hio{t) 
for different values of rjj can therefore be directly compared. The quantity hiQ^t) can also be 
interpreted as the probability that the oldest mutation in a sample of i sequences is not older 
than t time units. Figure Ic reports values of hio{t) when the random variable Y has the beta 
distribution with parameters (a, /3). The values of hiQ(t) in Figure Ic are directly comparable for 
different values of the parameters a and /3 since they are not timescale parameters. If 9 is large, 
ancestral lineages will vanish quickly from a population admitting large offspring numbers, one 
concludes from Figure 1. In addition, the effects of 9 seem stronger than those of the coalescence 
parameters, for the range of parameter values considered in Figure 1. 
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4 The subsample process 

Two alleles drawn at random from a standard Moran population have probability 1 /3 of sharing 
their most recent common ancestor with the whole population present at the time the two alleles 
were sampled. This result follows from the probability (j — + + l)(i — 1)] that j of i 

lineages share a most recent common ancestor with the i lineages ( [Saunders et al.[ [1984 1 . This 



result indicates that even samples of even moderate size will yield a good estimate of the time to 
the most recent common ancestor of the whole population. To further study the rate of turnover 
of alleles in populations admitting large offspring numbers, we consider simple properties of the 
gene genealogical process of nested subsamples. 

One can recursively compute the conditional distribution of the number of ancestors of the 
subsample, given the number of ancestors of the sample, as shown in section [7.5[ Closed- form 



expressions can be obtained for a population with the Kingman coalescent (Griffiths 1980 



Kingman 1982b Littler 1975 Tavare 19841. The presence of multiple mergers makes closed- 
form expressions hard to obtain for a general A coalescent. In this section we will be concerned 
with the probability that a subsample shares it's most recent common ancestor with the sample. 
Denote by Al(u) the number of ancestors of the sample when the the number of ancestors of the 
subsample first reaches a given value u. Similary, denote by A2{v) the number of ancestors of the 
subsample when the number Ai of ancestors of the sample first reaches some value v. Closed-form 
expressions for the distributions of A^ and A2 have been obtained for a usual Moran population 



(Saunders et al. 19841. The conditional distributions of A^ and A2 when associated with a A 
coalescent may be computed recursively, as shown in Appendix. 

To compute our quantity of interest, let ■ipij{u,v) = F[Al{v) — u\Ai{0) = 1,^2(0) — j]. 
The probability ■0^ ^(1,1) that the subsample shares its' most recent common ancestor with 
the whole sample is straightforward to compute recursively as follows. Writing i' = i — k + I, 
f = {j - ^ + 1)1{<!>0} +il{£=o}, one obtains 

i jAk fj\ 

^..,(l,l) = ^/3(z,OE^^7Tp^^^'./(l'l) (18) 

fe=2 £=0 \k) 
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with the boundary conditions = 1 and = for z > 1. The result obtained 

from the usual Moran model, 1) = (j — l)(i + l)/[(j + l)(i — 1)], can be recovered from 



Equation (18l. The recursion (18l simplifies considerably when j = 2. Writing ■(/;(*) = "01,2(1, 1), 



one obtains 

^{i) = ^(*' ^-k + l) - fe + 1) + Pit, 2) ^ + 1). (19) 

When associated with a A coalescent, can be quite small (Table [2]). For comparison, the 
corresponding values when associated with the Kingman coalescent are ■0(500) = 0.335, and 
■0(1000) = 0.334. An indicator of how well a sample represents the population it is drawn from 
may be taken as the probability 0ooj(lj !)• A sample of moderate size drawn from a standard 
Moran population represents the population quite well, since then ipao.ji^, 1) = (j — l)/(j + !)• 
The results in Table ([2| indicate that only a large sample drawn from a population admitting 
large ofi^spring numbers has a good chance of sharing its' most recent common ancestor with the 
whole population, if we interpret the sample as the whole population. 

The conditional distributions of Al and A2 will be the same for models 1 and 2 as for the 
usual Moran model. Hence, samples of size j drawn from populations following a large class of 
reproduction law, or from populations differing in their population size, will all have probability 
(j — l)/(j + 1) of sharing their most recent common ancestor with the population present at the 
time of sampling. 

5 The subsample process including mutation 

The age of an allele is fundamental to population genetics. A sample of size j has probability 
j /{j + 9) of including the oldest allele in the population, if drawn from a usual Moran population 



(Saunders et al. 1984 Tavare 1984 Watterson and Guess 1977). Slatkin (20001 bases a test 



for selection on allele age. Results concerning age of alleles can be obtained by considering the 
ancestral process of nested subsamples when the ancestral process includes mutation. 

One might wish to know how the expected number of ancestors of a sample changes given 
the number of ancestors of a subsample. Define and A2 in the same way as Al(a) and ^2(0), 
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respectively. The quantity ^*(u) denotes the number of ancestors of the sample, including 
mutation, when the number of ancestors of the subsample first takes value u. The conditional 
distributions of and can be computed recursively (see Appendix) when associated with 
a population admitting large offspring numbers. The conditional expected value £[^2(0^)] is 
graphed as a function of the number of ancestors of the sample for three examples of a A 
coalescent. The idea behind calculating £[^2(0^)] is to infer how the number of ancestors of a 
sample changes relative to the whole population, by treating the sample as the population. The 
number of ancestors of a sample drawn from a population admitting large offspring numbers 
would then, on average, be expected to be smaller than if drawn from a usual Moran population, 
for a given number of ancestors for the whole population. Furthermore, mutation rate appears 
to matter little for the Kingman coalescent, while being a more important player for the A 
coalescent examples we are considering (Figure 2). The results in Figure 2 suggest that the 
expected number Ea[A2] of ancestors of the subsample when associated with a A coalescent 
would tend to be smaller than the corresponding value £^[^2] associated with the Kingman 
coalescent. In addition, Ea[^2] be convex in some cases, while Ei<-[A2] is a concave function 
of the number of ancestors of the sample. In Figure 3, values of the number Ea[^*] of ancestors 
of the sample when associated with a A coalescent are graphed as a function of the number of 
ancestors of the subsample. The results in Figure 3 show that Ea[^i] may be either smaller or 
larger than the corresponding value Ek ] associated with the Kingman coalescent. In addition, 
while Ek[^i] is a convex function of the number of ancestors of the subsample, Ea[^i] seems to 
range from being convex to appearing an almost linear function of the number of ancestors of 
the subsample. 

An expression for the probability that a subsample of a sample drawn from a usual Moran 
model population carries the oldest allele of the sample yields a remarkably simple expression 
for the probability that the sample carries the oldest allele of the population. The probability 
u{i, j) = (0, 0) that a subsample includes the oldest allele of a sample drawn from a population 
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admitting large offspring numbers is straightforward to compute recursively, namely 

(20) 

+ X! /^(*' * - ^) X! ^ -k,j- ^l{£>i}) 

fe=l f=0 Vfc+lj 

with boundary probabilities u(j,j) = 1, and 'u(«,0) = if z > 0. Table 3 shows values of 
?i(1000, 2) varying over 9 and tt. The corresponding values for the Kingman coalescent are 



m(1000,2) = 0.667 when 61 = 1, and ?i(1000,2) = 0.287 when 6^5 ( |Kelly[|1977| [Saunders et al. 



1984). The results in Table 3 indicate that two lineages drawn from a population admitting large 
offspring numbers will, in some cases, have quite a low probability of being the oldest allele of 
the population, if we interpret the sample as the population. One might also be interested in 
how quickly the probability of the subsample carrying the oldest allele of the sample increases 
with the subsample size. Figure 4 reports values of i2(150,j) as a function of j for the three 
example A coalescents. While u{i,j) is a concave function of j, and thus increases quickly with 
i when associated with the Kingman coalescent, u(i,j) increases slower, or almost linearly in 
some cases, when associated with a A coalescent. Thus, a subsample of a sample drawn from 
a population admitting large offspring numbers has smaller probability of carrying the oldest 
allele of the sample than if the sample was drawn from a usual Moran population. 

The conditional distributions of and A2 derived from models 1 or 2 will be different from 
the distributions derived from the usual Moran model. Denote the conditional distribution of 
A* hy^ih,£2), 

0(^1,^2) = ¥{Al{h) = £2|ii(0) = I, ^2(0) - j). 



Saunders et al. (1984) obtain 0(^1,^2) by solving the recursion (Theorem 6 in (Saunders et al 



1984| )) 

^(f , 1 . ^ (4-^2 + l)(4+^2 + g) , , , , , ,, (^2 + l)(4 + g) 

with (j)(i,£2) = 1 if ^2 = j, and (f)(i,£2) — otherwise. In a population following model 1 or 2, 
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one obtains 



(22) 



where we subpress the subscript ton [3 {— Pi), and 6* ~ 6 / j3. Hence, the conditional distribution 



of A2 is given by Theorem 6 in Saunders at al. ( 1984 1, with 9 replaced by 9/ j3. The same applies 
to the conditional distribution of A\ (see Theorem 7 in 



Saunders et al. 



( |1984[ )). In this context 

we must mention the scaling of mutation. We insist that mutation scale in units of iV^ timesteps 
exactly, in order for the models to be comparable. Hence, the mutation rate 9 is the same 
quantity for all models that yield a Kingman like coalescent. Thus, the probability that the 
oldest allele of i lineages is among a subset of j lineages is + 9)/[i{j/3 + 9)], when associated 
with model 1 or 2. The oldest allele of a population associated with model 1 or 2 will therefore 
be among j lineages with probability jP/{j/3 + 9). The constant [3 can also be interpreted as a 
population size scaling constant, if we assume that time is scaled in units of N"^ timesteps. If 
the size of a Moran population is scaled by factor c, then (3 — \ jf? . 



Kelly (19771 and Saunders et al. (19841 obtain the distribution of the number of representa- 



tives Fi of the oldest allele in the sample of size i drawn from a population with the usual Moran 



reproduction. Using the same argument as in Saunders et al. ( 1984), one can recursively compute 



the distribution of Fi, using Equation (20 1. The expected value E[Fi] of when associated with 



a A coalescent tends to be smaller than when associated with the Kingman coalescent (Table 4) . 
The magnitude of the difference also varies with the type of the A coalescent, with values of E[i^i] 
associated with the beta coalescent being quite similar to the ones associated with the Kingman 
coalescent. In contrast, E[i^i] seems to converge to one as V' decreases. The expected value of Fi 
also tends to one when 9 increases, when associated with the usual Kingman coalescent. When 
associated with model 1 or 2, and large offspring number events are negligible, the results of 



Kelly (19771 and Saunders et al. (19841 yield 



Var[J^,] 



/3(g + /3)g(z-l) 
(/? + 0)2(2/3 + 0)' 
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6 Discussion and Conclusion 

The Wright-Fisher and Moran models have been the basis of the majority of work in theoretical 
population genetics. However, they make strong assumptions about the offspring distribution of 
individuals, that are not fulfilled by most natural populations. The simple generalized versions 
of the Moran model developed in the present work allow one, in a natural way, to account for 
at least some of the stochasticity inherent in natural populations. A further benefit is that the 
modified population models run on the same timescale, and are thus directly comparable. 

Population models admitting large offspring numbers are natural candidates for populations 
with highly fecund individuals, including a diverse group of marine taxa. In this work we 
consider a model in which a single individual contributes a random number of offspring at each 
timestep. An important open question is the distribution of the number of offspring, which will 
require insight into the biology and ecology of natural populations. Accurately modeling offspring 



distribution may be particularly important in conservation genetics (Amos and Balmford 2001 1. 
By incorporating the mean and variance of the offspring distribution into the coalescence rates, 
we show that populations with high variation in their offspring distribution will have lower genetic 
diversity than predicted by the usual Moran model, as is often observed in particular among 
marine populations. Admitting large offspring numbers leads to multiple merger coalescent 
processes, which can predict the starlike pattern of haplotypes often observed among marine 



fishes ( Grant and Bowen 1998 ) 



The multiple merger coalescent processes derived from population models admitting large 
offspring numbers are more difficult to handle mathematically than any version of a Kingman 
coalescent. Closed-form analytic expressions are hard to obtain. Exact computations of many 
quantitites can therefore only be done recursively when associated with multiple merger pro- 
cesses. Recursive computations limit us in some cases to small sample sizes. Important insights 
can still be obtained, and which can give clues to results for large sample sizes. 

In this work we consider the rate of turnover of alleles in populations while paying attention 
to the offspring distribution. Our main conclusion is that the rate of turnover of alleles increases 
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with the variance in the offspring distribution. This means that alleles will tend to be younger 
than in the usual Wright-Fisher or Moran populations. In addition, our results suggest that one 
may need large sample sizes to accurately predict the age of the most recent common ancestor 
of a population in populations with large offspring variance. In addition, one may need to draw 
a large sample from some populations admitting large offspring numbers before having a good 
chance of having caught the oldest allele of the population. 

The author was supported in part by EPSRC grant EP/G052026/1, in part by DFG grant BL 
1105/3-1, and by a Junior Research Fellowship at Lady Margaret Hall, University of Oxford. 
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7 Appendix 

7.1 Weak convergence of the coalescent process including mutation 

In this section we establish weak convergence in the function space Dg,{[0,oo)) of the coales- 
cent process including mutation and derived from a general Cannings Cannings ( |1974 1 model 
admitting large offspring numbers. In a Cannings model of a population of constant size N, the 
offspring variables v. are exchangeable, and i/-^ + ■ ■ ■ + — N. By convention, let -D^([0, oo)) 
denote the space of right continuous functions with left limits on [0, oo) with values in E, the set 
of all equivalence relations on {1, . . . ,n}. The coalescent process (^m),^^^^ without mutation 
describes the random ancestral relation among a set of n individuals drawn at random from the 
population at time m = 0. A state ^ of the process (i?m)mgNo ^■^ definition, the random 
equivalence relation containing individuals only if i and j share a common ancestor at 

timestep r in the past. The coalescent process starts in state {(«,«) : 1 < i < n}. The absorbing 
state is {(«, j) : 1 < J < n}. Let 6 = fei + • • • + 6a, in which 6i, . . . , 6q denote the number of 
classes of that merge into each of a classes of rj. Write (r7i)„ = 171(771 — 1) • • • (m — n + 1), 



(to)o = 1. Under an exchangeable Cannings (19741 reproduction model and fixed population 



size N, the transition probability ,j(A^) = ¥[Rra+i = 7j\R„i — ^] from state ^ to 77 is given by 



Pi AN) 



iN)t 



(23) 



Time is scaled in units of l/c^^ , in which c^^ = E[i'j (v-^ ^1)]/ ^^)- To pass to a continuous-time 
process, we require that limAr_j.oo c„ = 0. 

We will now list the conditions on the population model under which the scaled process 



{R\^) 1) converges weakly to (i?t),^„ (Sagitov 



1999 ) . Denote by A a finite measure on the 



Borel subsets of the unit interval. The three conditions are 



E[{v,-lf]^o{N); 



N 

hm — > Nx] 



y ^A{dy), < X < 1; 



(24) 



N- 
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( |Sagitov[p99| . Now define 



The conditions in Equation p4| yield 



—p^,^{N)-^ x^-^il-xy-^Aidx), k>2, a>l 



(25) 



(26) 



(Sagitov 1999 1, in which bi — k, b2 ~ ■ ■ ■ ~ ba ~ I- 



The coalescent process (^m)„gjj with mutation separates each relation into mutant ('new') 
and non-mutant ('old') equivalence classes. The terminology 'old' and 'new' is to remind that 



mutations are always to new types (alleles), and hence start new lines of descent (Donnelly and 



Tavare 1986 Tavare 1984). Only non-mutant equivalence classes either coalesce or mutate. Let 



E denote the set of all equivalence classes on {1, . . . ,n} in which each class is partitioned into 



old and new classes (see Mohle (1999 1). The total number of old classes of the current state ^ is 



b + m. The transition probability for the transition from ^ to r/ is 

paN) = /i™(i - M J'l^E [{^x ■ ■ ■ 



(27) 



To retain mutations in the limit process, we require that the mutation probability scales 
according to c„ , namely 

^, 0<6i<oo. (28) 



lim 

N—>-oo C 

in which 6 is a constant. By ^ ^ r/ denote a transition due to mutation, in which one old class of 
^ becomes a new class of ij. By ^ ^ rj denote a transition from ^ to 77 in which S, d ij and rj denotes 
a merger of at least two of the equivalence classes of Define a sequence (5„ — {q^iS,, v))n ^eE 
of generators with entries 



9«(?,?y) 



'.[i/-^ ■ ■ ■ v^] if ^ ^ ?7 



-El. 



(N) 



if ^ = ?7 



otherwise. 



(29) 
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Equations ( 26 29 1 now imply that the sequence Q^^ of generators converges entry- wise to a 



generator Q — {q{S,,'r]))^^ with entries 

7o x'=-2(l-x)l«l-'=A(dx) 



e/2 



if ^ 77 
if ^ ^ ?7 



-leii - /o (1 - (1 - ^O'^l - - x)l«l-i)x-2A(d:r) if = C 



(30) 



^0 otherwise. 
Since the state space E is finite, the entry-wise convergence Qjv ^ Q inipHes weak convergence 



in D^{[0,oo)) of Ryt/c^\ to [Rt',t> O) with generator Q, by Theorem 17.25(i) in Kallenberg 



(1997) 



7.2 Generators of the coalescent processes 

In this section we describe the generators of the coalescent processes obtained from models 1 
and 2. 

By E denote the set of all equivalence relations on {1, . . . , n}. One readily obtains convergence 

of the finite-dimensional distributions of Rm- To prove convergence of the time-scaled process 

R^t^Nl the space De[0, 00) of all functions on [0, 00) with values in E that are right continuous 

and with left limits, we apply the convergence result of the previous section and obtain that the 

ancestral process (^R^iTn] ! ^ 5^ O) when associated with population model 1 or 2 converges weakly 

in De[0,oo) to a continuous-time process {Rt',t > 0) whose rate matrix Q depends on the limit 

N'^s^. To describe the transition matrices, define a,, associated with model (. £ {1,2}, 

a, = E[X^] 

(31) 

a, = E[X{X + 1)] 

The constants play similar role as the constants /3e associated with the gene genelaogical 
processes. Let |^| denote the number of classes (lineages) in relation ^. If N'^e^ — >■ the rate 
matrix 



Qi = i^f]^ associated with model I £ {1:2} is given by 



if e 77 
otherwise. 



(32) 
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in which f -< 77 now denotes a merger of at most two equivalence classes of ^. The usual Kingman 
coalescent is obtained from ( 32 ) by taking X in ([T]) to be a constant with value 1 . If large offspring 



number events are moderately frequent, ie. if N'^e^^ ^ in which </> is a constant, the entries 
g*^^-* of the Qg matrix are 



a. 



'-a,(f)-0E[i-(i-y)i?i-|e|r(i-r)i«i-i] ife = '/ 

otherwise, 
in which ^ < r/ denotes a merger of at least three equivalence classes of ^. Finally, if -/V^£„ — > 00 
then large offspring number events occur quite frequently, and only one Q matrix is obtained, 
whose entries are 

{E[r'=(i-r)i«i-'=] if C ^ ?7, 2 < fc < 1^1 

_E[i-(i-r)i«i-|c|r(i-r)i«i-i] ifC = v (34) 
otherwise. 

7.3 Eigenvectors of the rate matrix associated with At 

The eigenvectors of the rate matrix associated with either At or At under the Kingman coales- 



cent can be obtained in closed form (Tavare 19841. The rate matrix associated with At when 



associated with a A coalescent is triangular. The eigenvectors can be computed recursively as 
follows. Let ^e^) = ■ ■ ■ i^i*^') and r^'^) — {r[''\ . . . ,r^J'^) denote the left and right eigenvec- 

tors, respectively, corresponding to eigenvalue — q^ ^. Then i^p — 5u, i'p^ = if j > fc, 
= 1 and, writing = 

^ «^+i^5i±:::±^M^, i<j<k. (35) 

Considering the right eigenvectors we have r'^^ = (1, . . . , 1), r''^^ = 1, r''^^ = if j < fc, and 

rW^ '^-"'" l<fc<j<n (36) 

One confirms that, for sample size two, ^(i) = (1,0), f'^) = (-1,1), A'^"> = (1,1), r^^) = (0,1), 
yielding P[At = l|Ao = 2] = 1 - P[A = 2|Ao = 2] = 1 - e*«2 . 

The eigenvectors of the rate matrix associated with At when associated with a A coalescent 



can similarly be obtained recursively (Equations 35 36l, with i^^^ — Sqj, and r'"' = (1, . . . , 1). 
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7.4 The subsample coalescent process 

In this section wc give the generators associated with the coancestry coalescent process, when the 
ancestry of a subsample is considered jointly with the ancestry of the sample. The convergence 
follows from the convergence result obtained in the first section of the Appendix. The generators 
will have similar form, though not quite the same, as the one associated with the gene genelogical 
coancestry process discussed in the main text. 

Let i?i and R2 denote the sample and the subsample process, respectively, on E. To specify 
the joint process {Ri{t),R2{t);t > 0), denote by ^ = (^1,^2) and rj = (?7i,?72) € the states of 
the joint process; with and rn denoting the states of Ri for i e {1, 2}. By ^ ^ r/ we denote the 
transition 

[6 -< vi, 6 = m- 

where ^1 -< rji denotes a merger of two classes of ^1. By ^ < 77 we denote the transition 

Ci < vi, 6 < m 
^i<m, 6 = »?2 

in which ^1 =< r]i denotes a merger of three or more classes of r]i, which then can involve more 
than two classes (^2 =^ '72)) exactly two (^2 ^ t?2)5 or at most one (^2 = 1I2) class of the subsample 
process. 

In discrete time, the joint process {Riif), R2{m))^^j^ is a Markov chain with transition 
probabilities Pj^^(A^) determined by 

{Nh (1^1). ^ ^ 

in which U^''^ = {U)k + {U)k-i- The results of previous sections now give convergence of 
finite-dimensional distributions of {RiUtT^ \) , R2iltT^ \)) to i?2(i)). If -/V^£« large 

offspring number events are negligible and the rate matrix of {Ri{t), R2{t)) associated with 
model i & {1,2} has entries 

C„^ai^b^|2-K^ C^^- 0<i<2A|6| 

(38) 
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in which are given in Equation (31 ). If -/V^£„ — > where (/) > is a constant, the rate matrix 



associated with model i has entries 



e^?7, 0<j<2A|6l 

<?!>E[y'=(i ~ ''^''''^ijiy''^''^^' 3<fc<|ei|, 0<j<fcA|e2| 

t-a, - 0E[i - (1 - - - e = 



(39) 

If -/V^Ejy ^ large offspring number events are dominating and only one rate matrix is possible, 
whose entries are 

0<i</cA|6|; 



(40) 



E[l - (1 - F)l«il - |Ci|F(l - F)l«il-i] e = 
7.5 The conditional distributions of Ai and A2 

In this section we obtain the conditional distribution of the number of ancestors A2 of the 
subsample, given the number of ancestors Ai of the sample. The conditional distribution of A2, 



and the results of section ( 7.3 1 then yield the joint distribution of Ai and A2. The results for the 



subsample process are straightforward extensions of those obtained for the Kingman coalescent 



(Saunders et al. 19841. To keep notation in line with Saunders et al. (19841, let Ai{t) denote the 
Markov chain counting the number of distinct ancestral lineages at time t > before the present 
of a set of lineages drawn from the population at time zero. Let A2{t) denote the corresponding 
number for a subset of the lineages drawn at time zero. Define Ti{a) = inf {t : Ai{t) — a}, 
T2{a) = mi{t : A2{t) = a}, Al{a) = Ai{T2{a)), and A^{a) = A2{Ti{a)). Given Ai{t) = a, 
denote by /3{a,a — k) the probability that the next merger involves fc + 1 active ancestral lineages 
of the sample. The process {A2{a — ^i), £1 = 0, . . . , a — 1) is a Markov chain with transition 
probabilities 

f[A*2ia, -k) = a, - j|A;(aJ = aj = ^'^'][}^'^'^ (41) 

\k + V 

for 1 < J < min(A;, — 1), and 1 < fc < — 1, and marginal probabilities 

P[A;(aJ = aJA*(aJ = aJ = l, I < a, < a,. (42) 
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The transition probability (41) is not a function of the coalescence rates, since we condition on 



a merger of fc + 1 lineages. However, the conditional distribution 

of A2, given Ai, will be, due to the presence of multiple mergers. Write </>(•, •) — (t>i,j{-, ■) for ease 
of presentation. The conditional distribution of A2, given Ai, can be obtained as a recursion, 

i — a k/\j / ^a^+i^^ ^a-^+k — a^—e^^ 



fa-^+k\ 
\ k+1 J 



k=l i=\ \ \ 

(43) 

+ 0(a, +fc,aj -—^ 

V fc+l / / 

with the boundary conditions = 1 and 1) = 1 for all i > 1. By way of example, 

01, 1(1,1) = 02a(2,l) = (/'2, 1(1,1) = 1 and ?!)3,2(2,1) = /?(3,2)/3. The joint distribution of 
{Ai{t),A2{t)) is 

P[Ai(t) = ai,A2(i) = a2|Ai(0) = 1,^2(0) =j] = 5z,a, (*)</-»,, (a^, a J (44) 



in which (j)i^j{a-^^,ar^) is given in Equation (43 1 



The distribution of A\{-) can be given via a simple recursion, 
P[A*(aJ = aJAi(0)=i,A2(0)-j] = 

i — ai feA(j — 02) /a^ +^Waj +fe — — ^\ 

5]/?(a,+fc,aJ ^ ^,_^.(a^+fc,a^+^)ii±iA_i^^. (45) 

k=l i=l \ k + 1 ) 

7.6 The subsample coalescent process including mutation 

In this section we give the generators associated with the coancestry coalescent process including 
neutral mutation. Let Ri and R2 denote the coalescent processes for the sample and the sub- 
sample, respectively, including mutation. As we are assuming an infinite allele/sites mutation 
model, mutation results in 'new' alleles, and the probability of recurrent mutation is ignored. 
The joint process takes values in E'^, in which the transition ^ ^ ry, as previously, denotes a 
transition due to mutation; 

^ ^ rj = < (46) 
27 



The first line in ( 46 ) denotes the transition in which the mutated Hneage does not belong to the 



subsample, while the second line means that the mutated lineage does belong to the subsample. 
The set E denotes the space of all equivalence relations on {1, . . . ,n} in which each relation ^ is 
partitioned into a set of new and old equivalence classes (Mohle 1999[ ). 

The joint ancestral process [R{k); fc e N) = (^Ri{k), R2{k)) ^^^^ is a Markov chain on with 
transition probability P^^^(A^) given by, in which m denotes the number of classes that mutate 
and m + 6 is the number of old classes of ^, 



(47) 



in which 2 < fc < |^i|, and < j < A: A |^2|- The joint process R ([tT/vJ) associated with model 



If N^e^ large 



£ e {1,2} converges weakly to R{t) with generator Qg 
offspring number events are negligible and the generator associated with model £ G {1,2} has 
entries 



e/2 



(l«l|)2 



C^^, 0<j<2A|e2| 



(48) 



in which the constants are given in Equation (31). If -/V^e^, 
the generator associated with model £ has entries 

'e/2 



(/.E[r'=(l - y)ICi|-fc] (kil-felK-.(fel), 



where > is a constant, 

^^V, 0<j<2A|6l 
^<V. 3<fc<|a| 



l-ieiif-CI^')^ 



(l_y)l6l_|^^|y(l_y)l€i|-i] ^ = ^ 



l«i|-ii 



(49) 



If N'^s,^ — >■ 00 large offspring number events are dominating and the single resulting generator 
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has entries 



E[F^(1 - y)lgil-2j (ISil-IS2n2-j(IS2l)j 



(50) 



6»/2 ^ ^ ?7 

^^V, 0<i<2A|6| 

-j^^^ — i<v, 3</c<ieii 

[-laif - E[i - (1 - y)i«ii - |ei|y(i - y)i«^i-i] ^ = v 

7.7 The conditional distributions of Al and 

The conditional distribution of ^2(")) given initial subsample size j from initial sample size i, is 
obtained similarly as the conditional distribution of A2(-). Define 



4>i,j{aua2) = P [A*2{ai) = a2\A2{0) = i,ii(0) = i] , 

let A(a) denote the transition rate of the sample process Ai given a active ancestral lineages, 
and let I3{a + k, a) denote the probability of coalescence of fc + 1 out of a + fc active ancestral 
lineages. The forward equations of •) — (f>ij{-, •) arc given by 



4>{ai,a2) = ^(ai + 1,02) ( ^{ai + l,ai) 



(oi - 02 + l)(ai + 02) {ai + l)9/2ai- a2 + l 



(oi + l)ai 



+ 



A(ai + 1) 1 + ai 



-,w 0/ . X (02 + 1)02 (ai + l)6'/2 02 + 1 

+ (^ai + l,a2 + l /3ai + l,ai ^' + \ . '' — r 

' (ai + l)ai A(ai + 1) ai + 1 



^ (^(ai + A:,a2)/3(ai + A, oi)— 

fe=2 I fe+1 j 



E 

fe=2 



02 ("i+'^~"2~\ 

(ai + fc, a2);8(ai + fc, ai) 



i-ai kf\{j-a2) 



+ E E + 02 + ^)^(01+ A;, ai) ^ ^ '^"^ ' 



fc=2 ^=2 



V k+1 ) 



(51) 



with boundary condition 4>i^k{h k) = Skj- 
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The conditional distribution of A* can be obtained similarly. Define, 

iJi,j{aua2) = P [Al{a2) = ai|ii(0) = ^,^2(0) = j] . (52) 
The forward equations for •) = •) are given by 

^(a„ a.) = ^(a. + + 1) Ua. + 1, «i) + %^ ^) 

V (ai + Ijai A(ai + 1) ai + 1/ 

(53) 

i-ai kA{j-a2) /a2+i\ /ai+k-a2-e\ 

fe=2 ^=2 V fc+1 / 



with boundary conditions 'tpi,j{i,j) = V'i,i(') ') = 1- By way of example, 



V;2,i(0,0) = /3(2,1) + ^ (54) 
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Table 1: An approximation of the probability hj(t) (17 1 that j of the original lines in the 
population are still segregating at time i = 1 for different values of 6 and /3^. Results for the 



usual Kingman coalescent are obtained when 



1. 



e 


I3e 





j 

1 


> 2 


1 


1 


0.036 


0.320 


0.644 




2 


0.133 


0.654 


0.213 




5 


0.273 


0.718 


0.009 


5 


1 


0.556 


0.379 


0.065 




2 


0.719 


0.270 


0.011 




5 


0.836 


0.164 


0.000 
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Table 2: The probability -0^.2(1, 1) (Equation 18 1 of two lineages sharing a most recent common 
ancestor with the i lineages, varying over i and tt. When < tt < 1, tt denotes tJj of the point 
mass process; tt denotes a of the beta coalescent when 1 < tt < 2, and finally the vector (a, f3) 
of the two-parameter beta coalescent. 

i TT -0^,2(1,1) 



500 


0.005 


0.330 




0.5 


0.065 


1000 


0.005 


0.328 




0.5 


0.048 


500 


1.05 


0.178 




1.5 


0.266 


1000 


1.05 


0.163 




1.5 


0.261 


500 


(1.0,1.0) 


0.061 




(1.0,5.0) 


0.122 


1000 


(1.0,1.0) 


0.041 




(1.0,5.0) 


0.105 
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Table 3: The probability u(1000, 2) that the oldest allele of a sample is among two lineages 
drawn from a sample of size 1000. Cases of tt < 1 refer to the point mass process, of 1 < tt < 2 
to the beta coalescent, and tt = (a, (3) refers to the two-parameter beta coalescent. 



e 


TT 


u(1000,2) 


1 


1.05 


0.506 




1.5 


0.598 


5 


1.05 


0.111 




1.5 


0.189 


1 


0.05 


0.003 




0.5 


0.071 


5 


0.05 


0.002 




0.5 


0.008 


1 


(1,1) 


0.148 




(1,5) 


0.010 


5 


(1,1) 


0.022 




(1,5) 


0.003 


1 


(5,1) 


0.354 


5 


(5,1) 


0.071 


1 


(5,5) 


0.084 


5 


(5,5) 


0.010 
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Table 4: The mean E[Fj] and variance V[Fj] of the number Fi of the oldest allele in a sample 
of size i = 30 varying over 6 and tt. The corresponding values for the Kingman coalescent are 
E[Fi] = 15.500 and Y[Fi] = 74.91 7 for 6 = 1, and E[Fi] = 5.833 and Y[Fi] = 20.139 for (9 = 5. 



e 


TT 




V[F,] 


1 


1.05 


14.834 


93.625 




1.5 


15.026 


84.975 


5 


1.05 


5.112 


34.612 




1.5 


5.269 


26.698 


1 


0.05 


1.061 


0.096 




0.5 


5.182 


38.028 


5 


0.05 


1.012 


0.018 




0.5 


1.824 


5.995 


1 


(1,1) 


7.792 


67.565 




(5,1) 


13.570 


96.908 




(1,5) 


1.845 


5.027 




(5,5) 


5.752 


45.621 


5 


(1,1) 


2.630 


18.001 




(5,1) 


4.923 


44.402 




(1,5) 


1.163 


0.826 




(5,5) 


1.978 


8.200 
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Figure 1: The probability hio{t) that i Uneages have vanished from the population by time t 
as a function of time for i = 20 and varying over 6 and the coalescence parameters tjj {&), a (b), 
and (a, /3) (c) as shown in the legends. 

(a) V-coalescent 



X ++++'^ o°°° 
^ ++ o° 
^ o 

+ 


+ 

+ o 






+ 

o 








o (0.5,0.05) 
A (5, 0.05) 
+ (0.5, 0.5) 




A + 

o 


X (5,0.5) 




+ o 

_i_ «o 
TffOOOOO 







5 10 15 20 25 30 



(b) beta coalescent 



1.0 - 


& & A & A 












0.8 - 














0.6 - 
0.4 - 




{6, a) 






o (0.5, 1.05) 
A (5, 1.05) 
+ (0.5, 1.5) 
X (5, 1 .5) 




* 







2 4 6 8 10 



0.2 



(c) beta(Q;, (3) coalescent 

. + + + t 



X o (0.5,0.5,0.5) 

° A (5, 0.5, 5) 

* + (0.5, 5, 0.5) 

0.4 - X (5, 5, 5) 



time t 
35 



Figure 2: Expected number E[j4*(aJ = aJA^ = 150, ^4^ = 50] of lineages ancestral to the 
subsample as a function of when the population has the a-coalescent with a and the mutation 
rate 6 varying as shown in the legend. Right panels are for the two parameter beta coalescent. 




Figure 3: Expected value E[A*{aJ = aJA^(O) = 150,^2(0) = 50] as a function of the 
number of ancestral lineages belonging to the subsample varying over tt and 6 as shown in 
the legends. By K we denote the Kingman coalescent. 
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Figure 4: The probability ii(150, j) that the oldest allele of a sample is among j lineages as 
a function of j from 1 to 50 and varying over (tt, 0) as shown in the legends (in the same form 
as in Figure 3). By K we denote the usual Kingman coalescent. 
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